Quantitative analysis of population-scale family trees with millions of relatives.
نویسندگان
چکیده
Family trees have vast applications in multiple fields from genetics to anthropology and economics. However, the collection of extended family trees is tedious and usually relies on resources with limited geographical scope and complex data usage restrictions. Here, we collected 86 million profiles from publicly-available online data shared by genealogy enthusiasts. After extensive cleaning and validation, we obtained population-scale family trees, including a single pedigree of 13 million individuals. We leveraged the data to partition the genetic architecture of longevity by inspecting millions of relative pairs and to provide insights into the geographical dispersion of families. We also report a simple digital procedure to overlay other datasets with our resource in order to empower studies with population-scale genealogical data.
منابع مشابه
COVID-19 and the Lived Experience of People Facing it; a Quantitative Study
Aims: Due to the widespread outbreak of COVID-19, thousands of people have died, and millions of people have been infected around the world, putting communities at great risk. The present study assessed the lived experience of people infected by COVID-19. Participants & Methods: This qualitative research with a phenomenological approach was conducted in March 2020 in Boroujerd, Lorestan. Using...
متن کاملQuantitative Comparison of Tree Pairs Resulted from Gene and Protein Phylogenetic Trees for Sulfite Reductase Flavoprotein Alpha-Component and 5S rRNA and Taxonomic Trees in Selected Bacterial Species
Introduction: FAD is the cofactor of FAD-FR protein family. Sulfite reductase flavoprotein alpha-component is one of the main enzymes of this family. Based on applications of this enzyme in biotechnology and industry, it was chosen as the subject of evolutionary studies in 19 specific species. Method: Gene and protein sequences of sulfite reductase flavoprotein alpha-component, 5S rRNA sequence...
متن کاملBarriers to Participation of Breast Cancer Patients’ Relatives in Mammographic Screening
Introduction: Breast cancer is the most common female cancer in the world and Iran and the leading cause of cancer death among Iranian women. One way to control this cancer is to get screened and diagnosed early. Given that screening in the general population is not possible, early detection of this cancer in high-risk women is one way to control it. Mammography is one way to diagnose breast ca...
متن کاملQuantitative Comparison of Tree Pairs Resulted from Gene and Protein Phylogenetic Trees for Sulfite Reductase Flavoprotein Alpha-Component and 5S rRNA and Taxonomic Trees in Selected Bacterial Species
Introduction: FAD is the cofactor of FAD-FR protein family. Sulfite reductase flavoprotein alpha-component is one of the main enzymes of this family. Based on applications of this enzyme in biotechnology and industry, it was chosen as the subject of evolutionary studies in 19 specific species. Method: Gene and protein sequences of sulfite reductase flavoprotein alpha-component, 5S rRNA sequence...
متن کاملQuantitative Analysis of Genealogy Using Digitised Family Trees
Driven by the popularity of television shows such as Who Do You Think You Are? many millions of users have uploaded their family tree to web projects such as WikiTree [1]. Analysis of this corpus enables us to investigate genealogy computationally. The study of heritage in the social sciences has led to an increased understanding of ancestry and descent [2] but such efforts are hampered by diff...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Science
دوره شماره
صفحات -
تاریخ انتشار 2018